Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 6736 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 789.5 KiB |
| Average record size in memory | 120.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 5 |
TIME is highly correlated with S and 13 other fields | High correlation |
S is highly correlated with TIME and 13 other fields | High correlation |
T1 is highly correlated with TIME and 13 other fields | High correlation |
T4 is highly correlated with TIME and 13 other fields | High correlation |
T5 is highly correlated with TIME and 13 other fields | High correlation |
T9 is highly correlated with TIME and 13 other fields | High correlation |
T10 is highly correlated with TIME and 13 other fields | High correlation |
T11 is highly correlated with TIME and 13 other fields | High correlation |
T12 is highly correlated with TIME and 13 other fields | High correlation |
Z is highly correlated with TIME and 13 other fields | High correlation |
T2 is highly correlated with TIME and 13 other fields | High correlation |
T3 is highly correlated with TIME and 11 other fields | High correlation |
T6 is highly correlated with TIME and 11 other fields | High correlation |
T7 is highly correlated with TIME and 11 other fields | High correlation |
T8 is highly correlated with TIME and 11 other fields | High correlation |
TIME is uniformly distributed | Uniform |
TIME has unique values | Unique |
S has 728 (10.8%) zeros | Zeros |
Z has 80 (1.2%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-11 03:28:23.353863 |
|---|---|
| Analysis finished | 2022-11-11 03:28:29.432437 |
| Duration | 6.08 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 6736 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 280.625 |
| Minimum | 0 |
|---|---|
| Maximum | 561.25 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 28.0625 |
| Q1 | 140.3125 |
| median | 280.625 |
| Q3 | 420.9375 |
| 95-th percentile | 533.1875 |
| Maximum | 561.25 |
| Range | 561.25 |
| Interquartile range (IQR) | 280.625 |
Descriptive statistics
| Standard deviation | 162.0550032 |
|---|---|
| Coefficient of variation (CV) | 0.5774788534 |
| Kurtosis | -1.2 |
| Mean | 280.625 |
| Median Absolute Deviation (MAD) | 140.3333333 |
| Skewness | -1.33126377 × 10-16 |
| Sum | 1890290 |
| Variance | 26261.82407 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 372.8333333 | 1 | < 0.1% |
| 374.8333333 | 1 | < 0.1% |
| 374.75 | 1 | < 0.1% |
| 374.6666667 | 1 | < 0.1% |
| 374.5833333 | 1 | < 0.1% |
| 374.5 | 1 | < 0.1% |
| 374.4166667 | 1 | < 0.1% |
| 374.3333333 | 1 | < 0.1% |
| 374.25 | 1 | < 0.1% |
| Other values (6726) | 6726 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 0.08333333333 | 1 | |
| 0.1666666667 | 1 | |
| 0.25 | 1 | |
| 0.3333333333 | 1 | |
| 0.4166666667 | 1 | |
| 0.5 | 1 | |
| 0.5833333333 | 1 | |
| 0.6666666667 | 1 | |
| 0.75 | 1 |
| Value | Count | Frequency (%) |
| 561.25 | 1 | |
| 561.1666667 | 1 | |
| 561.0833333 | 1 | |
| 561 | 1 | |
| 560.9166667 | 1 | |
| 560.8333333 | 1 | |
| 560.75 | 1 | |
| 560.6666667 | 1 | |
| 560.5833333 | 1 | |
| 560.5 | 1 |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11055.98352 |
| Minimum | 0 |
|---|---|
| Maximum | 18000 |
| Zeros | 728 |
| Zeros (%) | 10.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 8000 |
| median | 12001 |
| Q3 | 12001 |
| 95-th percentile | 18000 |
| Maximum | 18000 |
| Range | 18000 |
| Interquartile range (IQR) | 4001 |
Descriptive statistics
| Standard deviation | 5092.430336 |
|---|---|
| Coefficient of variation (CV) | 0.4606040093 |
| Kurtosis | 0.1699434198 |
| Mean | 11055.98352 |
| Median Absolute Deviation (MAD) | 4001 |
| Skewness | -0.6083629808 |
| Sum | 74473105 |
| Variance | 25932846.73 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=8)
| Value | Count | Frequency (%) |
| 12001 | 2877 | |
| 18000 | 1412 | |
| 8000 | 1136 | 16.9% |
| 0 | 728 | 10.8% |
| 7999 | 302 | 4.5% |
| 9998 | 253 | 3.8% |
| 17998 | 27 | 0.4% |
| 11090 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 728 | 10.8% |
| 7999 | 302 | 4.5% |
| 8000 | 1136 | 16.9% |
| 9998 | 253 | 3.8% |
| 11090 | 1 | < 0.1% |
| 12001 | 2877 | |
| 17998 | 27 | 0.4% |
| 18000 | 1412 |
| Value | Count | Frequency (%) |
| 18000 | 1412 | |
| 17998 | 27 | 0.4% |
| 12001 | 2877 | |
| 11090 | 1 | < 0.1% |
| 9998 | 253 | 3.8% |
| 8000 | 1136 | 16.9% |
| 7999 | 302 | 4.5% |
| 0 | 728 | 10.8% |
| Distinct | 41 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.84732779 |
| Minimum | 24.8 |
|---|---|
| Maximum | 26.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.8 KiB |
Quantile statistics
| Minimum | 24.8 |
|---|---|
| 5-th percentile | 24.8 |
| Q1 | 25.5 |
| median | 25.8 |
| Q3 | 25.9 |
| 95-th percentile | 26.8 |
| Maximum | 26.8 |
| Range | 2 |
| Interquartile range (IQR) | 0.4 |
Descriptive statistics
| Standard deviation | 0.5819726713 |
|---|---|
| Coefficient of variation (CV) | 0.0225157771 |
| Kurtosis | -0.4272213702 |
| Mean | 25.84732779 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | 0.2617425871 |
| Sum | 174107.6 |
| Variance | 0.3386921901 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=41)
| Value | Count | Frequency (%) |
| 25.5 | 1434 | |
| 25.9 | 1417 | |
| 26.8 | 1366 | |
| 25.8 | 1183 | |
| 24.8 | 552 | 8.2% |
| 25.7 | 244 | 3.6% |
| 24.9 | 118 | 1.8% |
| 25.6 | 112 | 1.7% |
| 25 | 67 | 1.0% |
| 25.4 | 37 | 0.5% |
| Other values (31) | 206 | 3.1% |
| Value | Count | Frequency (%) |
| 24.8 | 552 | |
| 24.85 | 2 | < 0.1% |
| 24.9 | 118 | 1.8% |
| 24.95 | 2 | < 0.1% |
| 25 | 67 | 1.0% |
| 25.05 | 1 | < 0.1% |
| 25.1 | 26 | 0.4% |
| 25.15 | 1 | < 0.1% |
| 25.2 | 22 | 0.3% |
| 25.25 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 26.8 | 1366 | |
| 26.75 | 2 | < 0.1% |
| 26.7 | 28 | 0.4% |
| 26.65 | 2 | < 0.1% |
| 26.6 | 20 | 0.3% |
| 26.55 | 2 | < 0.1% |
| 26.5 | 7 | 0.1% |
| 26.45 | 2 | < 0.1% |
| 26.4 | 5 | 0.1% |
| 26.35 | 3 | < 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 52.8 KiB |
| 24.8 | |
|---|---|
| 24.7 | |
| 24.6 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 26944 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 24.8 |
|---|---|
| 2nd row | 24.8 |
| 3rd row | 24.8 |
| 4th row | 24.8 |
| 5th row | 24.8 |
Common Values
| Value | Count | Frequency (%) |
| 24.8 | 3823 | |
| 24.7 | 1527 | 22.7% |
| 24.6 | 1386 | 20.6% |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 24.8 | 3823 | |
| 24.7 | 1527 | 22.7% |
| 24.6 | 1386 | 20.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 6736 | |
| 4 | 6736 | |
| . | 6736 | |
| 8 | 3823 | |
| 7 | 1527 | 5.7% |
| 6 | 1386 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20208 | |
| Other Punctuation | 6736 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 6736 | |
| 4 | 6736 | |
| 8 | 3823 | |
| 7 | 1527 | 7.6% |
| 6 | 1386 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6736 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26944 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 6736 | |
| 4 | 6736 | |
| . | 6736 | |
| 8 | 3823 | |
| 7 | 1527 | 5.7% |
| 6 | 1386 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26944 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 6736 | |
| 4 | 6736 | |
| . | 6736 | |
| 8 | 3823 | |
| 7 | 1527 | 5.7% |
| 6 | 1386 | 5.1% |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 52.8 KiB |
| 24.7 | |
|---|---|
| 24.8 | |
| 24.5 | |
| 24.6 | 70 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 26944 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 24.8 |
|---|---|
| 2nd row | 24.8 |
| 3rd row | 24.8 |
| 4th row | 24.8 |
| 5th row | 24.8 |
Common Values
| Value | Count | Frequency (%) |
| 24.7 | 3064 | |
| 24.8 | 2228 | |
| 24.5 | 1374 | |
| 24.6 | 70 | 1.0% |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 24.7 | 3064 | |
| 24.8 | 2228 | |
| 24.5 | 1374 | |
| 24.6 | 70 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 6736 | |
| 4 | 6736 | |
| . | 6736 | |
| 7 | 3064 | |
| 8 | 2228 | 8.3% |
| 5 | 1374 | 5.1% |
| 6 | 70 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20208 | |
| Other Punctuation | 6736 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 6736 | |
| 4 | 6736 | |
| 7 | 3064 | |
| 8 | 2228 | 11.0% |
| 5 | 1374 | 6.8% |
| 6 | 70 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6736 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26944 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 6736 | |
| 4 | 6736 | |
| . | 6736 | |
| 7 | 3064 | |
| 8 | 2228 | 8.3% |
| 5 | 1374 | 5.1% |
| 6 | 70 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26944 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 6736 | |
| 4 | 6736 | |
| . | 6736 | |
| 7 | 3064 | |
| 8 | 2228 | 8.3% |
| 5 | 1374 | 5.1% |
| 6 | 70 | 0.3% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.11469715 |
| Minimum | 24.9 |
|---|---|
| Maximum | 25.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.8 KiB |
Quantile statistics
| Minimum | 24.9 |
|---|---|
| 5-th percentile | 24.9 |
| Q1 | 25 |
| median | 25.1 |
| Q3 | 25.1 |
| 95-th percentile | 25.5 |
| Maximum | 25.5 |
| Range | 0.6 |
| Interquartile range (IQR) | 0.1 |
Descriptive statistics
| Standard deviation | 0.1829093358 |
|---|---|
| Coefficient of variation (CV) | 0.007282960041 |
| Kurtosis | -0.2383397748 |
| Mean | 25.11469715 |
| Median Absolute Deviation (MAD) | 0.1 |
| Skewness | 1.005919754 |
| Sum | 169172.6 |
| Variance | 0.03345582512 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 25 | 2485 | |
| 25.1 | 1816 | |
| 24.9 | 793 | 11.8% |
| 25.5 | 654 | 9.7% |
| 25.4 | 599 | 8.9% |
| 25.3 | 259 | 3.8% |
| 25.2 | 130 | 1.9% |
| Value | Count | Frequency (%) |
| 24.9 | 793 | 11.8% |
| 25 | 2485 | |
| 25.1 | 1816 | |
| 25.2 | 130 | 1.9% |
| 25.3 | 259 | 3.8% |
| 25.4 | 599 | 8.9% |
| 25.5 | 654 | 9.7% |
| Value | Count | Frequency (%) |
| 25.5 | 654 | 9.7% |
| 25.4 | 599 | 8.9% |
| 25.3 | 259 | 3.8% |
| 25.2 | 130 | 1.9% |
| 25.1 | 1816 | |
| 25 | 2485 | |
| 24.9 | 793 | 11.8% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.0959323 |
| Minimum | 25 |
|---|---|
| Maximum | 25.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.8 KiB |
Quantile statistics
| Minimum | 25 |
|---|---|
| 5-th percentile | 25 |
| Q1 | 25 |
| median | 25 |
| Q3 | 25.1 |
| 95-th percentile | 25.4 |
| Maximum | 25.5 |
| Range | 0.5 |
| Interquartile range (IQR) | 0.1 |
Descriptive statistics
| Standard deviation | 0.1465102823 |
|---|---|
| Coefficient of variation (CV) | 0.005838009146 |
| Kurtosis | 0.3107204832 |
| Mean | 25.0959323 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.34325014 |
| Sum | 169046.2 |
| Variance | 0.02146526283 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=6)
| Value | Count | Frequency (%) |
| 25 | 4046 | |
| 25.1 | 1152 | 17.1% |
| 25.4 | 863 | 12.8% |
| 25.2 | 319 | 4.7% |
| 25.3 | 280 | 4.2% |
| 25.5 | 76 | 1.1% |
| Value | Count | Frequency (%) |
| 25 | 4046 | |
| 25.1 | 1152 | 17.1% |
| 25.2 | 319 | 4.7% |
| 25.3 | 280 | 4.2% |
| 25.4 | 863 | 12.8% |
| 25.5 | 76 | 1.1% |
| Value | Count | Frequency (%) |
| 25.5 | 76 | 1.1% |
| 25.4 | 863 | 12.8% |
| 25.3 | 280 | 4.2% |
| 25.2 | 319 | 4.7% |
| 25.1 | 1152 | 17.1% |
| 25 | 4046 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 52.8 KiB |
| 25.0 | |
|---|---|
| 25.1 | |
| 25.2 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 26944 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 25.2 |
|---|---|
| 2nd row | 25.2 |
| 3rd row | 25.2 |
| 4th row | 25.2 |
| 5th row | 25.2 |
Common Values
| Value | Count | Frequency (%) |
| 25.0 | 3225 | |
| 25.1 | 2904 | |
| 25.2 | 607 | 9.0% |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 25.0 | 3225 | |
| 25.1 | 2904 | |
| 25.2 | 607 | 9.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 7343 | |
| 5 | 6736 | |
| . | 6736 | |
| 0 | 3225 | |
| 1 | 2904 | 10.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20208 | |
| Other Punctuation | 6736 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 7343 | |
| 5 | 6736 | |
| 0 | 3225 | |
| 1 | 2904 | 14.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6736 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26944 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 7343 | |
| 5 | 6736 | |
| . | 6736 | |
| 0 | 3225 | |
| 1 | 2904 | 10.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26944 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 7343 | |
| 5 | 6736 | |
| . | 6736 | |
| 0 | 3225 | |
| 1 | 2904 | 10.8% |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 52.8 KiB |
| 24.6 | |
|---|---|
| 24.7 | |
| 24.4 | |
| 24.5 | 84 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 26944 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 24.7 |
|---|---|
| 2nd row | 24.7 |
| 3rd row | 24.7 |
| 4th row | 24.7 |
| 5th row | 24.7 |
Common Values
| Value | Count | Frequency (%) |
| 24.6 | 3061 | |
| 24.7 | 2212 | |
| 24.4 | 1379 | |
| 24.5 | 84 | 1.2% |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 24.6 | 3061 | |
| 24.7 | 2212 | |
| 24.4 | 1379 | |
| 24.5 | 84 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 8115 | |
| 2 | 6736 | |
| . | 6736 | |
| 6 | 3061 | 11.4% |
| 7 | 2212 | 8.2% |
| 5 | 84 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20208 | |
| Other Punctuation | 6736 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 8115 | |
| 2 | 6736 | |
| 6 | 3061 | 15.1% |
| 7 | 2212 | 10.9% |
| 5 | 84 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6736 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26944 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 8115 | |
| 2 | 6736 | |
| . | 6736 | |
| 6 | 3061 | 11.4% |
| 7 | 2212 | 8.2% |
| 5 | 84 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26944 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 8115 | |
| 2 | 6736 | |
| . | 6736 | |
| 6 | 3061 | 11.4% |
| 7 | 2212 | 8.2% |
| 5 | 84 | 0.3% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 52.8 KiB |
| 24.8 | |
|---|---|
| 24.7 | |
| 24.9 | 152 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 26944 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 24.8 |
|---|---|
| 2nd row | 24.8 |
| 3rd row | 24.8 |
| 4th row | 24.8 |
| 5th row | 24.8 |
Common Values
| Value | Count | Frequency (%) |
| 24.8 | 5750 | |
| 24.7 | 834 | 12.4% |
| 24.9 | 152 | 2.3% |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 24.8 | 5750 | |
| 24.7 | 834 | 12.4% |
| 24.9 | 152 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 6736 | |
| 4 | 6736 | |
| . | 6736 | |
| 8 | 5750 | |
| 7 | 834 | 3.1% |
| 9 | 152 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20208 | |
| Other Punctuation | 6736 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 6736 | |
| 4 | 6736 | |
| 8 | 5750 | |
| 7 | 834 | 4.1% |
| 9 | 152 | 0.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6736 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26944 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 6736 | |
| 4 | 6736 | |
| . | 6736 | |
| 8 | 5750 | |
| 7 | 834 | 3.1% |
| 9 | 152 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26944 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 6736 | |
| 4 | 6736 | |
| . | 6736 | |
| 8 | 5750 | |
| 7 | 834 | 3.1% |
| 9 | 152 | 0.6% |
| Distinct | 71 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.92447855 |
| Minimum | 25.055 |
|---|---|
| Maximum | 28.73 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.8 KiB |
Quantile statistics
| Minimum | 25.055 |
|---|---|
| 5-th percentile | 25.685 |
| Q1 | 26.105 |
| median | 26.735 |
| Q3 | 27.89 |
| 95-th percentile | 28.31 |
| Maximum | 28.73 |
| Range | 3.675 |
| Interquartile range (IQR) | 1.785 |
Descriptive statistics
| Standard deviation | 0.9347115824 |
|---|---|
| Coefficient of variation (CV) | 0.03471605145 |
| Kurtosis | -1.40216905 |
| Mean | 26.92447855 |
| Median Absolute Deviation (MAD) | 0.84 |
| Skewness | 0.1244095758 |
| Sum | 181363.2875 |
| Variance | 0.8736857423 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 27.89 | 430 | 6.4% |
| 26 | 416 | 6.2% |
| 27.995 | 352 | 5.2% |
| 26.105 | 345 | 5.1% |
| 25.79 | 343 | 5.1% |
| 26.21 | 299 | 4.4% |
| 26.525 | 298 | 4.4% |
| 25.895 | 295 | 4.4% |
| 27.785 | 290 | 4.3% |
| 28.1 | 278 | 4.1% |
| Other values (61) | 3390 |
| Value | Count | Frequency (%) |
| 25.055 | 37 | |
| 25.1075 | 1 | < 0.1% |
| 25.16 | 52 | |
| 25.2125 | 1 | < 0.1% |
| 25.265 | 3 | < 0.1% |
| 25.3175 | 1 | < 0.1% |
| 25.37 | 4 | 0.1% |
| 25.4225 | 1 | < 0.1% |
| 25.475 | 55 | |
| 25.5275 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 28.73 | 29 | 0.4% |
| 28.6775 | 4 | 0.1% |
| 28.625 | 20 | 0.3% |
| 28.5725 | 4 | 0.1% |
| 28.52 | 64 | |
| 28.4675 | 8 | 0.1% |
| 28.415 | 95 | |
| 28.3625 | 10 | 0.1% |
| 28.31 | 143 | |
| 28.2575 | 14 | 0.2% |
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.60815766 |
| Minimum | 23.9 |
|---|---|
| Maximum | 25.37 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.8 KiB |
Quantile statistics
| Minimum | 23.9 |
|---|---|
| 5-th percentile | 24.215 |
| Q1 | 24.32 |
| median | 24.53 |
| Q3 | 24.845 |
| 95-th percentile | 25.16 |
| Maximum | 25.37 |
| Range | 1.47 |
| Interquartile range (IQR) | 0.525 |
Descriptive statistics
| Standard deviation | 0.3272315283 |
|---|---|
| Coefficient of variation (CV) | 0.01329768497 |
| Kurtosis | -0.823613354 |
| Mean | 24.60815766 |
| Median Absolute Deviation (MAD) | 0.21 |
| Skewness | 0.3771913063 |
| Sum | 165760.55 |
| Variance | 0.1070804731 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=15)
| Value | Count | Frequency (%) |
| 24.425 | 1295 | |
| 24.215 | 873 | |
| 24.32 | 662 | |
| 24.53 | 605 | |
| 24.635 | 557 | |
| 24.95 | 524 | |
| 24.845 | 510 | 7.6% |
| 24.74 | 441 | 6.5% |
| 25.055 | 423 | 6.3% |
| 25.16 | 417 | 6.2% |
| Other values (5) | 429 | 6.4% |
| Value | Count | Frequency (%) |
| 23.9 | 45 | 0.7% |
| 24.005 | 60 | 0.9% |
| 24.11 | 109 | 1.6% |
| 24.215 | 873 | |
| 24.32 | 662 | |
| 24.425 | 1295 | |
| 24.53 | 605 | |
| 24.635 | 557 | |
| 24.74 | 441 | 6.5% |
| 24.845 | 510 | 7.6% |
| Value | Count | Frequency (%) |
| 25.37 | 71 | 1.1% |
| 25.265 | 144 | 2.1% |
| 25.16 | 417 | 6.2% |
| 25.055 | 423 | 6.3% |
| 24.95 | 524 | |
| 24.845 | 510 | 7.6% |
| 24.74 | 441 | 6.5% |
| 24.635 | 557 | |
| 24.53 | 605 | |
| 24.425 | 1295 |
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.34796467 |
| Minimum | 23.375 |
|---|---|
| Maximum | 25.475 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.8 KiB |
Quantile statistics
| Minimum | 23.375 |
|---|---|
| 5-th percentile | 23.585 |
| Q1 | 23.795 |
| median | 24.215 |
| Q3 | 24.845 |
| 95-th percentile | 25.16 |
| Maximum | 25.475 |
| Range | 2.1 |
| Interquartile range (IQR) | 1.05 |
Descriptive statistics
| Standard deviation | 0.5610584566 |
|---|---|
| Coefficient of variation (CV) | 0.02304334117 |
| Kurtosis | -1.407614376 |
| Mean | 24.34796467 |
| Median Absolute Deviation (MAD) | 0.525 |
| Skewness | 0.1313485049 |
| Sum | 164007.89 |
| Variance | 0.3147865918 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=21)
| Value | Count | Frequency (%) |
| 23.69 | 725 | |
| 23.795 | 596 | 8.8% |
| 23.9 | 572 | 8.5% |
| 25.055 | 559 | 8.3% |
| 24.635 | 514 | 7.6% |
| 24.11 | 473 | 7.0% |
| 24.74 | 457 | 6.8% |
| 24.845 | 379 | 5.6% |
| 24.95 | 375 | 5.6% |
| 24.005 | 330 | 4.9% |
| Other values (11) | 1756 |
| Value | Count | Frequency (%) |
| 23.375 | 11 | 0.2% |
| 23.48 | 194 | 2.9% |
| 23.585 | 293 | |
| 23.69 | 725 | |
| 23.795 | 596 | |
| 23.9 | 572 | |
| 24.005 | 330 | |
| 24.11 | 473 | |
| 24.215 | 221 | 3.3% |
| 24.32 | 122 | 1.8% |
| Value | Count | Frequency (%) |
| 25.475 | 13 | 0.2% |
| 25.37 | 92 | 1.4% |
| 25.265 | 199 | 3.0% |
| 25.16 | 308 | |
| 25.055 | 559 | |
| 24.95 | 375 | |
| 24.845 | 379 | |
| 24.74 | 457 | |
| 24.635 | 514 | |
| 24.53 | 222 | 3.3% |
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.28901128 |
| Minimum | 23.48 |
|---|---|
| Maximum | 25.16 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.8 KiB |
Quantile statistics
| Minimum | 23.48 |
|---|---|
| 5-th percentile | 23.69 |
| Q1 | 24.005 |
| median | 24.215 |
| Q3 | 24.635 |
| 95-th percentile | 24.95 |
| Maximum | 25.16 |
| Range | 1.68 |
| Interquartile range (IQR) | 0.63 |
Descriptive statistics
| Standard deviation | 0.422295775 |
|---|---|
| Coefficient of variation (CV) | 0.01738628922 |
| Kurtosis | -1.07814048 |
| Mean | 24.28901128 |
| Median Absolute Deviation (MAD) | 0.315 |
| Skewness | 0.191670398 |
| Sum | 163610.78 |
| Variance | 0.1783337216 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=17)
| Value | Count | Frequency (%) |
| 24.005 | 762 | |
| 24.215 | 730 | |
| 24.845 | 708 | |
| 24.11 | 688 | |
| 24.635 | 528 | |
| 23.795 | 521 | |
| 23.69 | 497 | |
| 24.53 | 447 | 6.6% |
| 23.9 | 411 | 6.1% |
| 24.74 | 321 | 4.8% |
| Other values (7) | 1123 |
| Value | Count | Frequency (%) |
| 23.48 | 84 | 1.2% |
| 23.585 | 80 | 1.2% |
| 23.69 | 497 | |
| 23.795 | 521 | |
| 23.9 | 411 | |
| 24.005 | 762 | |
| 24.11 | 688 | |
| 24.215 | 730 | |
| 24.32 | 270 | 4.0% |
| 24.425 | 181 | 2.7% |
| Value | Count | Frequency (%) |
| 25.16 | 93 | 1.4% |
| 25.055 | 150 | 2.2% |
| 24.95 | 265 | 3.9% |
| 24.845 | 708 | |
| 24.74 | 321 | |
| 24.635 | 528 | |
| 24.53 | 447 | |
| 24.425 | 181 | 2.7% |
| 24.32 | 270 | 4.0% |
| 24.215 | 730 |
| Distinct | 87 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.941494878 |
| Minimum | -18.281 |
|---|---|
| Maximum | 31.0785 |
| Zeros | 80 |
| Zeros (%) | 1.2% |
| Negative | 768 |
| Negative (%) | 11.4% |
| Memory size | 52.8 KiB |
Quantile statistics
| Minimum | -18.281 |
|---|---|
| 5-th percentile | -7.312 |
| Q1 | 3.656 |
| median | 9.75 |
| Q3 | 10.969 |
| 95-th percentile | 29.25 |
| Maximum | 31.0785 |
| Range | 49.3595 |
| Interquartile range (IQR) | 7.313 |
Descriptive statistics
| Standard deviation | 10.98671019 |
|---|---|
| Coefficient of variation (CV) | 1.105136634 |
| Kurtosis | -0.08688786275 |
| Mean | 9.941494878 |
| Median Absolute Deviation (MAD) | 6.094 |
| Skewness | 0.2600738207 |
| Sum | 66965.9095 |
| Variance | 120.7078008 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 9.75 | 1218 | |
| 2.438 | 459 | 6.8% |
| 8.531 | 390 | 5.8% |
| 3.656 | 374 | 5.6% |
| 28.031 | 374 | 5.6% |
| 10.3595 | 339 | 5.0% |
| 26.813 | 298 | 4.4% |
| 9.1405 | 298 | 4.4% |
| 3.047 | 296 | 4.4% |
| 10.969 | 287 | 4.3% |
| Other values (77) | 2403 |
| Value | Count | Frequency (%) |
| -18.281 | 26 | |
| -17.6715 | 7 | 0.1% |
| -17.062 | 38 | |
| -16.453 | 1 | < 0.1% |
| -15.844 | 45 | |
| -15.2345 | 2 | < 0.1% |
| -14.625 | 35 | |
| -14.0155 | 4 | 0.1% |
| -13.406 | 35 | |
| -12.7965 | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 31.0785 | 5 | 0.1% |
| 30.469 | 39 | 0.6% |
| 29.8595 | 75 | 1.1% |
| 29.25 | 270 | |
| 28.6405 | 130 | 1.9% |
| 28.0315 | 1 | < 0.1% |
| 28.031 | 374 | |
| 27.422 | 142 | 2.1% |
| 26.813 | 298 | |
| 26.8125 | 4 | 0.1% |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| TIME | S | T1 | T2 | T3 | T4 | T5 | T6 | T7 | T8 | T9 | T10 | T11 | T12 | Z | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.000000 | 0 | 25.0 | 24.8 | 24.8 | 25.1 | 25.1 | 25.2 | 24.7 | 24.8 | 25.055 | 23.9 | 23.795 | 23.48 | 0.000 |
| 1 | 0.083333 | 0 | 25.0 | 24.8 | 24.8 | 25.0 | 25.1 | 25.2 | 24.7 | 24.8 | 25.055 | 23.9 | 23.795 | 23.48 | -1.219 |
| 2 | 0.166667 | 0 | 25.0 | 24.8 | 24.8 | 25.0 | 25.1 | 25.2 | 24.7 | 24.8 | 25.055 | 23.9 | 23.795 | 23.48 | 0.000 |
| 3 | 0.250000 | 0 | 25.0 | 24.8 | 24.8 | 25.0 | 25.1 | 25.2 | 24.7 | 24.8 | 25.055 | 23.9 | 23.795 | 23.48 | -1.219 |
| 4 | 0.333333 | 0 | 25.0 | 24.8 | 24.8 | 25.0 | 25.1 | 25.2 | 24.7 | 24.8 | 25.055 | 23.9 | 23.795 | 23.48 | -1.219 |
| 5 | 0.416667 | 0 | 25.0 | 24.8 | 24.8 | 25.0 | 25.1 | 25.2 | 24.7 | 24.8 | 25.055 | 23.9 | 23.795 | 23.48 | -1.219 |
| 6 | 0.500000 | 0 | 25.0 | 24.8 | 24.8 | 25.0 | 25.1 | 25.2 | 24.7 | 24.8 | 25.055 | 23.9 | 23.690 | 23.48 | -1.219 |
| 7 | 0.583333 | 0 | 25.0 | 24.8 | 24.8 | 25.0 | 25.1 | 25.2 | 24.7 | 24.8 | 25.055 | 23.9 | 23.795 | 23.48 | -1.219 |
| 8 | 0.666667 | 0 | 25.0 | 24.8 | 24.8 | 25.0 | 25.1 | 25.2 | 24.7 | 24.8 | 25.055 | 23.9 | 23.690 | 23.48 | -1.219 |
| 9 | 0.750000 | 0 | 25.0 | 24.8 | 24.8 | 25.0 | 25.1 | 25.2 | 24.7 | 24.8 | 25.055 | 23.9 | 23.690 | 23.48 | -1.219 |
Last rows
| TIME | S | T1 | T2 | T3 | T4 | T5 | T6 | T7 | T8 | T9 | T10 | T11 | T12 | Z | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6726 | 560.500000 | 9998 | 25.7 | 24.8 | 24.8 | 25.4 | 25.5 | 25.2 | 24.7 | 24.9 | 26.735 | 24.320 | 24.425 | 23.9 | 6.0940 |
| 6727 | 560.583333 | 9998 | 25.7 | 24.8 | 24.8 | 25.4 | 25.5 | 25.2 | 24.7 | 24.9 | 26.735 | 24.320 | 24.425 | 23.9 | 6.0940 |
| 6728 | 560.666667 | 9998 | 25.7 | 24.8 | 24.8 | 25.4 | 25.5 | 25.2 | 24.7 | 24.9 | 26.735 | 24.320 | 24.425 | 23.9 | 6.0940 |
| 6729 | 560.750000 | 9998 | 25.7 | 24.8 | 24.8 | 25.4 | 25.5 | 25.2 | 24.7 | 24.9 | 26.735 | 24.425 | 24.425 | 23.9 | 6.0940 |
| 6730 | 560.833333 | 9998 | 25.7 | 24.8 | 24.8 | 25.4 | 25.5 | 25.2 | 24.7 | 24.9 | 26.735 | 24.425 | 24.425 | 23.9 | 6.0940 |
| 6731 | 560.916667 | 9998 | 25.7 | 24.8 | 24.8 | 25.4 | 25.5 | 25.2 | 24.7 | 24.9 | 26.735 | 24.425 | 24.425 | 23.9 | 6.0940 |
| 6732 | 561.000000 | 9998 | 25.7 | 24.8 | 24.8 | 25.4 | 25.5 | 25.2 | 24.7 | 24.9 | 26.735 | 24.425 | 24.425 | 23.9 | 6.0940 |
| 6733 | 561.083333 | 9998 | 25.7 | 24.8 | 24.8 | 25.4 | 25.5 | 25.2 | 24.7 | 24.9 | 26.735 | 24.425 | 24.425 | 23.9 | 6.7035 |
| 6734 | 561.166667 | 9998 | 25.7 | 24.8 | 24.8 | 25.4 | 25.5 | 25.2 | 24.7 | 24.9 | 26.735 | 24.425 | 24.425 | 23.9 | 7.3130 |
| 6735 | 561.250000 | 9998 | 25.7 | 24.8 | 24.8 | 25.4 | 25.5 | 25.2 | 24.7 | 24.9 | 26.735 | 24.425 | 24.425 | 23.9 | 6.7035 |